Overview
Brought to you by YData
Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 899164 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 323.4 MiB |
| Average record size in memory | 377.2 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 12 |
| Text | 3 |
| DateTime | 1 |
RevLineCr has constant value "0" | Constant |
ApprovalFY is highly overall correlated with IsFranchise and 2 other fields | High correlation |
DisbursementGross is highly overall correlated with GrAppv and 2 other fields | High correlation |
GrAppv is highly overall correlated with DisbursementGross and 2 other fields | High correlation |
IsFranchise is highly overall correlated with ApprovalFY | High correlation |
RetainedJob is highly overall correlated with ApprovalFY | High correlation |
SBA_Appv is highly overall correlated with DisbursementGross and 2 other fields | High correlation |
Term is highly overall correlated with DisbursementGross and 2 other fields | High correlation |
UrbanRural is highly overall correlated with ApprovalFY | High correlation |
NoEmp is highly skewed (γ1 = 80.24824355) | Skewed |
CreateJob is highly skewed (γ1 = 36.99135473) | Skewed |
RetainedJob is highly skewed (γ1 = 36.85481184) | Skewed |
LoanNr_ChkDgt has unique values | Unique |
NAICS has 201948 (22.5%) zeros | Zeros |
CreateJob has 629248 (70.0%) zeros | Zeros |
RetainedJob has 440403 (49.0%) zeros | Zeros |
FranchiseCode has 208835 (23.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-02-10 10:13:52.176689 |
|---|---|
| Analysis finished | 2025-02-10 10:14:43.416760 |
| Duration | 51.24 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 741345 | |
| 0 | 157819 | 17.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 741345 | |
| 0 | 157819 | 17.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 741345 | |
| 0 | 157819 | 17.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 741345 | |
| 0 | 157819 | 17.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 741345 | |
| 0 | 157819 | 17.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 741345 | |
| 0 | 157819 | 17.6% |
LoanNr_ChkDgt
Real number (ℝ)
Unique 
| Distinct | 899164 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.7726123 × 109 |
| Minimum | 1.000014 × 109 |
|---|---|
| Maximum | 9.996003 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 1.000014 × 109 |
|---|---|
| 5-th percentile | 1.3484572 × 109 |
| Q1 | 2.5897575 × 109 |
| median | 4.361439 × 109 |
| Q3 | 6.9046265 × 109 |
| 95-th percentile | 9.1648039 × 109 |
| Maximum | 9.996003 × 109 |
| Range | 8.995989 × 109 |
| Interquartile range (IQR) | 4.314869 × 109 |
Descriptive statistics
| Standard deviation | 2.538175 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.53182091 |
| Kurtosis | -1.086499 |
| Mean | 4.7726123 × 109 |
| Median Absolute Deviation (MAD) | 2.0134 × 109 |
| Skewness | 0.3647571 |
| Sum | 4.2913612 × 1015 |
| Variance | 6.4423325 × 1018 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 9996003010 | 1 | < 0.1% |
| 1000014003 | 1 | < 0.1% |
| 1000024006 | 1 | < 0.1% |
| 1000034009 | 1 | < 0.1% |
| 1000044001 | 1 | < 0.1% |
| 1000054004 | 1 | < 0.1% |
| 1000084002 | 1 | < 0.1% |
| 1000093009 | 1 | < 0.1% |
| 1000094005 | 1 | < 0.1% |
| 1000104006 | 1 | < 0.1% |
| Other values (899154) | 899154 |
| Value | Count | Frequency (%) |
| 1000014003 | 1 | |
| 1000024006 | 1 | |
| 1000034009 | 1 | |
| 1000044001 | 1 | |
| 1000054004 | 1 | |
| 1000084002 | 1 | |
| 1000093009 | 1 | |
| 1000094005 | 1 | |
| 1000104006 | 1 | |
| 1000124001 | 1 |
| Value | Count | Frequency (%) |
| 9996003010 | 1 | |
| 9995973006 | 1 | |
| 9995613003 | 1 | |
| 9995603000 | 1 | |
| 9995573004 | 1 | |
| 9995563001 | 1 | |
| 9995493004 | 1 | |
| 9995473009 | 1 | |
| 9995453003 | 1 | |
| 9995423005 | 1 |
State
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 43.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IN |
|---|---|
| 2nd row | IN |
| 3rd row | IN |
| 4th row | OK |
| 5th row | FL |
| Value | Count | Frequency (%) |
| ca | 130621 | 14.5% |
| tx | 70462 | 7.8% |
| ny | 57693 | 6.4% |
| fl | 41213 | 4.6% |
| pa | 35170 | 3.9% |
| oh | 32622 | 3.6% |
| il | 29669 | 3.3% |
| ma | 25272 | 2.8% |
| mn | 24374 | 2.7% |
| nj | 24036 | 2.7% |
| Other values (41) | 428032 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 306178 | |
| C | 184959 | |
| N | 181729 | |
| M | 132551 | 7.4% |
| T | 125074 | 7.0% |
| I | 119519 | 6.6% |
| O | 94908 | 5.3% |
| L | 88820 | 4.9% |
| X | 70462 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425873 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1798328 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 306178 | |
| C | 184959 | |
| N | 181729 | |
| M | 132551 | 7.4% |
| T | 125074 | 7.0% |
| I | 119519 | 6.6% |
| O | 94908 | 5.3% |
| L | 88820 | 4.9% |
| X | 70462 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425873 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1798328 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 306178 | |
| C | 184959 | |
| N | 181729 | |
| M | 132551 | 7.4% |
| T | 125074 | 7.0% |
| I | 119519 | 6.6% |
| O | 94908 | 5.3% |
| L | 88820 | 4.9% |
| X | 70462 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425873 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1798328 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 306178 | |
| C | 184959 | |
| N | 181729 | |
| M | 132551 | 7.4% |
| T | 125074 | 7.0% |
| I | 119519 | 6.6% |
| O | 94908 | 5.3% |
| L | 88820 | 4.9% |
| X | 70462 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425873 |
Zip
Real number (ℝ)
| Distinct | 33611 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53804.391 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 283 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3838 |
| Q1 | 27587 |
| median | 55410 |
| Q3 | 83704 |
| 95-th percentile | 95822 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 56117 |
Descriptive statistics
| Standard deviation | 31184.159 |
|---|---|
| Coefficient of variation (CV) | 0.5795839 |
| Kurtosis | -1.3359893 |
| Mean | 53804.391 |
| Median Absolute Deviation (MAD) | 28206 |
| Skewness | -0.16816663 |
| Sum | 4.8378972 × 1010 |
| Variance | 9.7245178 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10001 | 933 | 0.1% |
| 90015 | 926 | 0.1% |
| 93401 | 806 | 0.1% |
| 90010 | 733 | 0.1% |
| 33166 | 671 | 0.1% |
| 90021 | 666 | 0.1% |
| 59601 | 640 | 0.1% |
| 65804 | 599 | 0.1% |
| 3801 | 581 | 0.1% |
| 59101 | 578 | 0.1% |
| Other values (33601) | 892031 |
| Value | Count | Frequency (%) |
| 0 | 283 | |
| 1 | 24 | < 0.1% |
| 2 | 11 | < 0.1% |
| 3 | 5 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 6 | < 0.1% |
| 8 | 15 | < 0.1% |
| 9 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 209 | |
| 99950 | 3 | < 0.1% |
| 99929 | 15 | < 0.1% |
| 99928 | 1 | < 0.1% |
| 99926 | 1 | < 0.1% |
| 99925 | 4 | < 0.1% |
| 99923 | 1 | < 0.1% |
| 99921 | 13 | < 0.1% |
| 99919 | 2 | < 0.1% |
| 99918 | 1 | < 0.1% |
UrbanRural
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.9 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Bank
Text
| Distinct | 5803 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.9 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 26 |
| Mean length | 23.159879 |
| Min length | 3 |
Unique
| Unique | 923 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | FIFTH THIRD BANK |
|---|---|
| 2nd row | 1ST SOURCE BANK |
| 3rd row | GRANT COUNTY STATE BANK |
| 4th row | 1ST NATL BK & TR CO OF BROKEN |
| 5th row | FLORIDA BUS. DEVEL CORP |
| Value | Count | Frequency (%) |
| bank | 651608 | |
| natl | 318240 | 9.0% |
| assoc | 306768 | 8.7% |
| of | 142852 | 4.1% |
| national | 125899 | 3.6% |
| america | 100686 | 2.9% |
| association | 84965 | 2.4% |
| fargo | 63732 | 1.8% |
| wells | 63650 | 1.8% |
| 52264 | 1.5% | |
| Other values (3603) | 1608268 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2762231 | |
| 2620014 | ||
| N | 2105500 | |
| S | 1520499 | 7.3% |
| O | 1336993 | 6.4% |
| T | 1181841 | 5.7% |
| C | 1134642 | 5.4% |
| I | 1061717 | 5.1% |
| E | 923739 | 4.4% |
| L | 922583 | 4.4% |
| Other values (44) | 5254770 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20824529 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 2762231 | |
| 2620014 | ||
| N | 2105500 | |
| S | 1520499 | 7.3% |
| O | 1336993 | 6.4% |
| T | 1181841 | 5.7% |
| C | 1134642 | 5.4% |
| I | 1061717 | 5.1% |
| E | 923739 | 4.4% |
| L | 922583 | 4.4% |
| Other values (44) | 5254770 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20824529 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 2762231 | |
| 2620014 | ||
| N | 2105500 | |
| S | 1520499 | 7.3% |
| O | 1336993 | 6.4% |
| T | 1181841 | 5.7% |
| C | 1134642 | 5.4% |
| I | 1061717 | 5.1% |
| E | 923739 | 4.4% |
| L | 922583 | 4.4% |
| Other values (44) | 5254770 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20824529 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 2762231 | |
| 2620014 | ||
| N | 2105500 | |
| S | 1520499 | 7.3% |
| O | 1336993 | 6.4% |
| T | 1181841 | 5.7% |
| C | 1134642 | 5.4% |
| I | 1061717 | 5.1% |
| E | 923739 | 4.4% |
| L | 922583 | 4.4% |
| Other values (44) | 5254770 |
BankState
Text
| Distinct | 57 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 43.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.0087081 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OH |
|---|---|
| 2nd row | IN |
| 3rd row | IN |
| 4th row | OK |
| 5th row | FL |
| Value | Count | Frequency (%) |
| ca | 118116 | 13.1% |
| nc | 79514 | 8.8% |
| il | 65908 | 7.3% |
| oh | 58461 | 6.5% |
| sd | 51095 | 5.7% |
| tx | 47790 | 5.3% |
| ri | 45366 | 5.0% |
| ny | 39592 | 4.4% |
| va | 29002 | 3.2% |
| de | 24537 | 2.7% |
| Other values (47) | 339783 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 241398 | |
| C | 229604 | |
| N | 187751 | |
| I | 158854 | 8.8% |
| O | 102604 | 5.7% |
| L | 96914 | 5.4% |
| D | 96078 | 5.3% |
| T | 94941 | 5.3% |
| M | 85034 | 4.7% |
| S | 73385 | 4.1% |
| Other values (18) | 439595 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1806158 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 241398 | |
| C | 229604 | |
| N | 187751 | |
| I | 158854 | 8.8% |
| O | 102604 | 5.7% |
| L | 96914 | 5.4% |
| D | 96078 | 5.3% |
| T | 94941 | 5.3% |
| M | 85034 | 4.7% |
| S | 73385 | 4.1% |
| Other values (18) | 439595 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1806158 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 241398 | |
| C | 229604 | |
| N | 187751 | |
| I | 158854 | 8.8% |
| O | 102604 | 5.7% |
| L | 96914 | 5.4% |
| D | 96078 | 5.3% |
| T | 94941 | 5.3% |
| M | 85034 | 4.7% |
| S | 73385 | 4.1% |
| Other values (18) | 439595 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1806158 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 241398 | |
| C | 229604 | |
| N | 187751 | |
| I | 158854 | 8.8% |
| O | 102604 | 5.7% |
| L | 96914 | 5.4% |
| D | 96078 | 5.3% |
| T | 94941 | 5.3% |
| M | 85034 | 4.7% |
| S | 73385 | 4.1% |
| Other values (18) | 439595 |
NAICS
Real number (ℝ)
Zeros 
| Distinct | 1312 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 398660.95 |
| Minimum | 0 |
|---|---|
| Maximum | 928120 |
| Zeros | 201948 |
| Zeros (%) | 22.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 235210 |
| median | 445310 |
| Q3 | 561730 |
| 95-th percentile | 811192 |
| Maximum | 928120 |
| Range | 928120 |
| Interquartile range (IQR) | 326520 |
Descriptive statistics
| Standard deviation | 263318.31 |
|---|---|
| Coefficient of variation (CV) | 0.66050691 |
| Kurtosis | -1.0476526 |
| Mean | 398660.95 |
| Median Absolute Deviation (MAD) | 176300 |
| Skewness | -0.26287834 |
| Sum | 3.5846157 × 1011 |
| Variance | 6.9336534 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 201948 | 22.5% |
| 722110 | 27989 | 3.1% |
| 722211 | 19448 | 2.2% |
| 811111 | 14585 | 1.6% |
| 621210 | 14048 | 1.6% |
| 624410 | 10111 | 1.1% |
| 812112 | 9230 | 1.0% |
| 561730 | 8935 | 1.0% |
| 621310 | 8733 | 1.0% |
| 812320 | 7894 | 0.9% |
| Other values (1302) | 576243 |
| Value | Count | Frequency (%) |
| 0 | 201948 | |
| 111110 | 32 | < 0.1% |
| 111120 | 3 | < 0.1% |
| 111130 | 1 | < 0.1% |
| 111140 | 94 | < 0.1% |
| 111150 | 49 | < 0.1% |
| 111160 | 2 | < 0.1% |
| 111191 | 3 | < 0.1% |
| 111199 | 7 | < 0.1% |
| 111211 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 928120 | 32 | |
| 928110 | 4 | < 0.1% |
| 927110 | 1 | < 0.1% |
| 926150 | 10 | < 0.1% |
| 926140 | 6 | < 0.1% |
| 926130 | 3 | < 0.1% |
| 926120 | 5 | < 0.1% |
| 926110 | 6 | < 0.1% |
| 925120 | 1 | < 0.1% |
| 925110 | 3 | < 0.1% |
NoEmp
Real number (ℝ)
Skewed 
| Distinct | 599 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.411353 |
| Minimum | 0 |
|---|---|
| Maximum | 9999 |
| Zeros | 6631 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 10 |
| 95-th percentile | 40 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 74.108196 |
|---|---|
| Coefficient of variation (CV) | 6.4942514 |
| Kurtosis | 7965.2886 |
| Mean | 11.411353 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 80.248244 |
| Sum | 10260678 |
| Variance | 5492.0248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 154254 | |
| 2 | 138297 | |
| 3 | 90674 | |
| 4 | 73644 | 8.2% |
| 5 | 60319 | 6.7% |
| 6 | 45759 | 5.1% |
| 10 | 31536 | 3.5% |
| 7 | 31495 | 3.5% |
| 8 | 31361 | 3.5% |
| 12 | 20822 | 2.3% |
| Other values (589) | 221003 |
| Value | Count | Frequency (%) |
| 0 | 6631 | 0.7% |
| 1 | 154254 | |
| 2 | 138297 | |
| 3 | 90674 | |
| 4 | 73644 | |
| 5 | 60319 | 6.7% |
| 6 | 45759 | 5.1% |
| 7 | 31495 | 3.5% |
| 8 | 31361 | 3.5% |
| 9 | 18131 | 2.0% |
| Value | Count | Frequency (%) |
| 9999 | 4 | |
| 9992 | 1 | < 0.1% |
| 9945 | 1 | < 0.1% |
| 9090 | 1 | < 0.1% |
| 9000 | 2 | < 0.1% |
| 8500 | 1 | < 0.1% |
| 8041 | 1 | < 0.1% |
| 8018 | 1 | < 0.1% |
| 8000 | 7 | |
| 7999 | 1 | < 0.1% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 645903 | |
| 2.0 | 253261 | 28.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 645903 | |
| 2.0 | 253261 | 28.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 899164 | |
| 0 | 899164 | |
| 1 | 645903 | |
| 2 | 253261 | 9.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2697492 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 899164 | |
| 0 | 899164 | |
| 1 | 645903 | |
| 2 | 253261 | 9.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2697492 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 899164 | |
| 0 | 899164 | |
| 1 | 645903 | |
| 2 | 253261 | 9.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2697492 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 899164 | |
| 0 | 899164 | |
| 1 | 645903 | |
| 2 | 253261 | 9.4% |
CreateJob
Real number (ℝ)
Skewed  Zeros 
| Distinct | 246 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.4303764 |
| Minimum | 0 |
|---|---|
| Maximum | 8800 |
| Zeros | 629248 |
| Zeros (%) | 70.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 10 |
| Maximum | 8800 |
| Range | 8800 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 236.68817 |
|---|---|
| Coefficient of variation (CV) | 28.075634 |
| Kurtosis | 1369.911 |
| Mean | 8.4303764 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 36.991355 |
| Sum | 7580291 |
| Variance | 56021.288 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 629248 | |
| 1 | 63174 | 7.0% |
| 2 | 57831 | 6.4% |
| 3 | 28806 | 3.2% |
| 4 | 20511 | 2.3% |
| 5 | 18691 | 2.1% |
| 10 | 11602 | 1.3% |
| 6 | 11009 | 1.2% |
| 8 | 7378 | 0.8% |
| 7 | 6374 | 0.7% |
| Other values (236) | 44540 | 5.0% |
| Value | Count | Frequency (%) |
| 0 | 629248 | |
| 1 | 63174 | 7.0% |
| 2 | 57831 | 6.4% |
| 3 | 28806 | 3.2% |
| 4 | 20511 | 2.3% |
| 5 | 18691 | 2.1% |
| 6 | 11009 | 1.2% |
| 7 | 6374 | 0.7% |
| 8 | 7378 | 0.8% |
| 9 | 3330 | 0.4% |
| Value | Count | Frequency (%) |
| 8800 | 648 | |
| 5621 | 1 | < 0.1% |
| 5199 | 1 | < 0.1% |
| 5085 | 1 | < 0.1% |
| 3500 | 1 | < 0.1% |
| 3100 | 1 | < 0.1% |
| 3000 | 4 | < 0.1% |
| 2515 | 1 | < 0.1% |
| 2140 | 1 | < 0.1% |
| 2020 | 1 | < 0.1% |
RetainedJob
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 358 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.797257 |
| Minimum | 0 |
|---|---|
| Maximum | 9500 |
| Zeros | 440403 |
| Zeros (%) | 49.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 20 |
| Maximum | 9500 |
| Range | 9500 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 237.1206 |
|---|---|
| Coefficient of variation (CV) | 21.961188 |
| Kurtosis | 1362.0182 |
| Mean | 10.797257 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 36.854812 |
| Sum | 9708505 |
| Variance | 56226.179 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 440403 | |
| 1 | 88790 | 9.9% |
| 2 | 76851 | 8.5% |
| 3 | 49963 | 5.6% |
| 4 | 39666 | 4.4% |
| 5 | 32627 | 3.6% |
| 6 | 23796 | 2.6% |
| 7 | 16530 | 1.8% |
| 8 | 15698 | 1.7% |
| 10 | 15438 | 1.7% |
| Other values (348) | 99402 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 440403 | |
| 1 | 88790 | 9.9% |
| 2 | 76851 | 8.5% |
| 3 | 49963 | 5.6% |
| 4 | 39666 | 4.4% |
| 5 | 32627 | 3.6% |
| 6 | 23796 | 2.6% |
| 7 | 16530 | 1.8% |
| 8 | 15698 | 1.7% |
| 9 | 8735 | 1.0% |
| Value | Count | Frequency (%) |
| 9500 | 1 | < 0.1% |
| 8800 | 648 | |
| 7250 | 1 | < 0.1% |
| 5000 | 1 | < 0.1% |
| 4441 | 1 | < 0.1% |
| 4000 | 2 | < 0.1% |
| 3900 | 1 | < 0.1% |
| 3860 | 1 | < 0.1% |
| 3225 | 1 | < 0.1% |
| 3200 | 1 | < 0.1% |
FranchiseCode
Real number (ℝ)
Zeros 
| Distinct | 2768 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2753.7259 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 208835 |
| Zeros (%) | 23.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 15805 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 12758.019 |
|---|---|
| Coefficient of variation (CV) | 4.6330025 |
| Kurtosis | 24.409524 |
| Mean | 2753.7259 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.9752152 |
| Sum | 2.4760512 × 109 |
| Variance | 1.6276705 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 638554 | |
| 0 | 208835 | 23.2% |
| 78760 | 3373 | 0.4% |
| 68020 | 1921 | 0.2% |
| 50564 | 1034 | 0.1% |
| 21780 | 1003 | 0.1% |
| 25650 | 715 | 0.1% |
| 79140 | 659 | 0.1% |
| 22470 | 615 | 0.1% |
| 17998 | 606 | 0.1% |
| Other values (2758) | 41849 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 208835 | 23.2% |
| 1 | 638554 | |
| 3 | 12 | < 0.1% |
| 395 | 5 | < 0.1% |
| 399 | 3 | < 0.1% |
| 400 | 2 | < 0.1% |
| 401 | 12 | < 0.1% |
| 404 | 1 | < 0.1% |
| 407 | 34 | < 0.1% |
| 414 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 1 | < 0.1% |
| 92006 | 4 | < 0.1% |
| 92000 | 9 | |
| 91999 | 11 | |
| 91450 | 2 | < 0.1% |
| 91446 | 1 | < 0.1% |
| 91443 | 2 | < 0.1% |
| 91435 | 1 | < 0.1% |
| 91424 | 1 | < 0.1% |
| 91423 | 2 | < 0.1% |
IsFranchise
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.9 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 690329 | |
| 0 | 208835 | 23.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 690329 | |
| 0 | 208835 | 23.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 690329 | |
| 0 | 208835 | 23.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 690329 | |
| 0 | 208835 | 23.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 690329 | |
| 0 | 208835 | 23.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 690329 | |
| 0 | 208835 | 23.2% |
Term
Real number (ℝ)
High correlation 
| Distinct | 412 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.77308 |
| Minimum | 0 |
|---|---|
| Maximum | 569 |
| Zeros | 810 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 60 |
| median | 84 |
| Q3 | 120 |
| 95-th percentile | 300 |
| Maximum | 569 |
| Range | 569 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 78.857305 |
|---|---|
| Coefficient of variation (CV) | 0.7118815 |
| Kurtosis | 0.18570424 |
| Mean | 110.77308 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | 1.1209258 |
| Sum | 99603164 |
| Variance | 6218.4746 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 84 | 230162 | |
| 60 | 89945 | 10.0% |
| 240 | 85982 | 9.6% |
| 120 | 77654 | 8.6% |
| 300 | 44727 | 5.0% |
| 180 | 28164 | 3.1% |
| 36 | 19800 | 2.2% |
| 12 | 17095 | 1.9% |
| 48 | 15621 | 1.7% |
| 72 | 9419 | 1.0% |
| Other values (402) | 280595 |
| Value | Count | Frequency (%) |
| 0 | 810 | 0.1% |
| 1 | 1608 | |
| 2 | 1809 | |
| 3 | 2112 | |
| 4 | 2173 | |
| 5 | 1866 | |
| 6 | 3054 | |
| 7 | 1761 | |
| 8 | 1693 | |
| 9 | 1875 |
| Value | Count | Frequency (%) |
| 569 | 1 | |
| 527 | 1 | |
| 511 | 1 | |
| 505 | 1 | |
| 481 | 1 | |
| 480 | 1 | |
| 461 | 1 | |
| 449 | 1 | |
| 445 | 1 | |
| 443 | 1 |
RevLineCr
Categorical
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.9 MiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 899164 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 899164 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 899164 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 899164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 899164 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 899164 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 788829 | |
| 1 | 110335 | 12.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 788829 | |
| 1 | 110335 | 12.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 788829 | |
| 1 | 110335 | 12.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 788829 | |
| 1 | 110335 | 12.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 788829 | |
| 1 | 110335 | 12.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 899164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 788829 | |
| 1 | 110335 | 12.3% |
DisbursementGross
Real number (ℝ)
High correlation 
| Distinct | 118859 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201154.02 |
| Minimum | 0 |
|---|---|
| Maximum | 11446325 |
| Zeros | 196 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10000 |
| Q1 | 42000 |
| median | 100000 |
| Q3 | 238000 |
| 95-th percentile | 761892.5 |
| Maximum | 11446325 |
| Range | 11446325 |
| Interquartile range (IQR) | 196000 |
Descriptive statistics
| Standard deviation | 287640.85 |
|---|---|
| Coefficient of variation (CV) | 1.4299533 |
| Kurtosis | 35.088599 |
| Mean | 201154.02 |
| Median Absolute Deviation (MAD) | 70000 |
| Skewness | 3.9409921 |
| Sum | 1.8087045 × 1011 |
| Variance | 8.2737259 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50000 | 43787 | 4.9% |
| 100000 | 36714 | 4.1% |
| 25000 | 27387 | 3.0% |
| 150000 | 23373 | 2.6% |
| 10000 | 21328 | 2.4% |
| 35000 | 14748 | 1.6% |
| 5000 | 14193 | 1.6% |
| 75000 | 13528 | 1.5% |
| 20000 | 13462 | 1.5% |
| 30000 | 12696 | 1.4% |
| Other values (118849) | 677948 |
| Value | Count | Frequency (%) |
| 0 | 196 | |
| 1 | 11 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11446325 | 1 | |
| 11000000 | 1 | |
| 10465000 | 1 | |
| 9284449 | 1 | |
| 8995000 | 1 | |
| 8607858 | 1 | |
| 8602584 | 1 | |
| 7853275 | 1 | |
| 7699233 | 1 | |
| 7573881 | 1 |
ApprovalDate
Date
| Distinct | 9859 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
| Minimum | 1961-12-07 00:00:00 |
|---|---|
| Maximum | 2014-06-25 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
ApprovalFY
Real number (ℝ)
High correlation 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2001.1436 |
| Minimum | 1962 |
|---|---|
| Maximum | 2014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 1962 |
|---|---|
| 5-th percentile | 1991 |
| Q1 | 1997 |
| median | 2002 |
| Q3 | 2006 |
| 95-th percentile | 2009 |
| Maximum | 2014 |
| Range | 52 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 5.9138459 |
|---|---|
| Coefficient of variation (CV) | 0.0029552332 |
| Kurtosis | -0.092531047 |
| Mean | 2001.1436 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.58537855 |
| Sum | 1.7993562 × 109 |
| Variance | 34.973573 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2005 | 77525 | 8.6% |
| 2006 | 76040 | 8.5% |
| 2007 | 71876 | 8.0% |
| 2004 | 68290 | 7.6% |
| 2003 | 58193 | 6.5% |
| 1995 | 45758 | 5.1% |
| 2002 | 44391 | 4.9% |
| 1996 | 40112 | 4.5% |
| 2008 | 39540 | 4.4% |
| 1997 | 37748 | 4.2% |
| Other values (41) | 339691 |
| Value | Count | Frequency (%) |
| 1962 | 1 | < 0.1% |
| 1965 | 1 | < 0.1% |
| 1966 | 1 | < 0.1% |
| 1967 | 2 | < 0.1% |
| 1968 | 2 | < 0.1% |
| 1969 | 4 | < 0.1% |
| 1970 | 8 | < 0.1% |
| 1971 | 20 | < 0.1% |
| 1972 | 27 | |
| 1973 | 52 |
| Value | Count | Frequency (%) |
| 2014 | 268 | < 0.1% |
| 2013 | 2458 | 0.3% |
| 2012 | 5997 | 0.7% |
| 2011 | 12608 | 1.4% |
| 2010 | 16848 | 1.9% |
| 2009 | 19126 | 2.1% |
| 2008 | 39540 | |
| 2007 | 71876 | |
| 2006 | 76040 | |
| 2005 | 77525 |
GrAppv
Real number (ℝ)
High correlation 
| Distinct | 22128 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 192686.98 |
| Minimum | 200 |
|---|---|
| Maximum | 5472000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 200 |
|---|---|
| 5-th percentile | 10000 |
| Q1 | 35000 |
| median | 90000 |
| Q3 | 225000 |
| 95-th percentile | 750000 |
| Maximum | 5472000 |
| Range | 5471800 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 283263.39 |
|---|---|
| Coefficient of variation (CV) | 1.4700702 |
| Kurtosis | 21.018882 |
| Mean | 192686.98 |
| Median Absolute Deviation (MAD) | 65000 |
| Skewness | 3.5207901 |
| Sum | 1.7325719 × 1011 |
| Variance | 8.0238149 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50000 | 69394 | 7.7% |
| 25000 | 51258 | 5.7% |
| 100000 | 50977 | 5.7% |
| 10000 | 38366 | 4.3% |
| 150000 | 27624 | 3.1% |
| 20000 | 23434 | 2.6% |
| 35000 | 23181 | 2.6% |
| 30000 | 21004 | 2.3% |
| 5000 | 19146 | 2.1% |
| 15000 | 18472 | 2.1% |
| Other values (22118) | 556308 |
| Value | Count | Frequency (%) |
| 200 | 2 | < 0.1% |
| 300 | 1 | < 0.1% |
| 400 | 2 | < 0.1% |
| 500 | 33 | < 0.1% |
| 700 | 4 | < 0.1% |
| 800 | 4 | < 0.1% |
| 950 | 1 | < 0.1% |
| 1000 | 444 | |
| 1200 | 12 | < 0.1% |
| 1300 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5472000 | 1 | < 0.1% |
| 5000000 | 40 | |
| 4991700 | 1 | < 0.1% |
| 4950000 | 1 | < 0.1% |
| 4908500 | 1 | < 0.1% |
| 4900000 | 2 | < 0.1% |
| 4872000 | 1 | < 0.1% |
| 4869000 | 1 | < 0.1% |
| 4830000 | 1 | < 0.1% |
| 4800000 | 1 | < 0.1% |
SBA_Appv
Real number (ℝ)
High correlation 
| Distinct | 38326 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149488.79 |
| Minimum | 100 |
|---|---|
| Maximum | 5472000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 5000 |
| Q1 | 21250 |
| median | 61250 |
| Q3 | 175000 |
| 95-th percentile | 626250 |
| Maximum | 5472000 |
| Range | 5471900 |
| Interquartile range (IQR) | 153750 |
Descriptive statistics
| Standard deviation | 228414.56 |
|---|---|
| Coefficient of variation (CV) | 1.5279712 |
| Kurtosis | 25.325514 |
| Mean | 149488.79 |
| Median Absolute Deviation (MAD) | 48750 |
| Skewness | 3.6752753 |
| Sum | 1.3441494 × 1011 |
| Variance | 5.2173212 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25000 | 49579 | 5.5% |
| 12500 | 40147 | 4.5% |
| 5000 | 31135 | 3.5% |
| 50000 | 25047 | 2.8% |
| 10000 | 17009 | 1.9% |
| 17500 | 16141 | 1.8% |
| 15000 | 14490 | 1.6% |
| 7500 | 12781 | 1.4% |
| 127500 | 11946 | 1.3% |
| 80000 | 10965 | 1.2% |
| Other values (38316) | 669924 |
| Value | Count | Frequency (%) |
| 100 | 2 | < 0.1% |
| 150 | 1 | < 0.1% |
| 200 | 2 | < 0.1% |
| 250 | 33 | < 0.1% |
| 350 | 4 | < 0.1% |
| 400 | 4 | < 0.1% |
| 475 | 1 | < 0.1% |
| 500 | 442 | |
| 600 | 12 | < 0.1% |
| 650 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5472000 | 1 | < 0.1% |
| 5000000 | 1 | < 0.1% |
| 4869000 | 1 | < 0.1% |
| 4582000 | 1 | < 0.1% |
| 4500000 | 23 | |
| 4492530 | 1 | < 0.1% |
| 4410000 | 1 | < 0.1% |
| 4320000 | 1 | < 0.1% |
| 4050000 | 4 | < 0.1% |
| 4000000 | 13 |
Interactions
Correlations
| ApprovalFY | CreateJob | DisbursementGross | FranchiseCode | GrAppv | IsFranchise | LoanNr_ChkDgt | LowDoc | MIS_Status | NAICS | NewExist | NoEmp | RetainedJob | SBA_Appv | Term | UrbanRural | Zip | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ApprovalFY | 1.000 | 0.268 | -0.222 | -0.452 | -0.300 | 0.644 | -0.278 | 0.375 | 0.327 | 0.447 | 0.054 | -0.226 | 0.546 | -0.366 | -0.297 | 0.659 | -0.038 |
| CreateJob | 0.268 | 1.000 | 0.110 | -0.054 | 0.093 | 0.046 | -0.031 | 0.010 | 0.012 | 0.157 | 0.003 | 0.034 | 0.377 | 0.078 | 0.082 | 0.025 | 0.026 |
| DisbursementGross | -0.222 | 0.110 | 1.000 | 0.204 | 0.965 | 0.030 | 0.102 | 0.051 | 0.031 | -0.124 | 0.021 | 0.445 | -0.070 | 0.936 | 0.521 | 0.040 | 0.115 |
| FranchiseCode | -0.452 | -0.054 | 0.204 | 1.000 | 0.259 | 0.131 | 0.392 | 0.036 | 0.022 | -0.091 | 0.139 | 0.121 | -0.263 | 0.285 | 0.196 | 0.013 | 0.031 |
| GrAppv | -0.300 | 0.093 | 0.965 | 0.259 | 1.000 | 0.099 | 0.139 | 0.116 | 0.074 | -0.147 | 0.050 | 0.455 | -0.138 | 0.986 | 0.558 | 0.051 | 0.119 |
| IsFranchise | 0.644 | 0.046 | 0.030 | 0.131 | 0.099 | 1.000 | 0.479 | 0.206 | 0.240 | 0.206 | 0.049 | 0.000 | 0.046 | 0.090 | 0.215 | 0.280 | 0.093 |
| LoanNr_ChkDgt | -0.278 | -0.031 | 0.102 | 0.392 | 0.139 | 0.479 | 1.000 | 0.247 | 0.237 | -0.050 | 0.086 | 0.075 | -0.142 | 0.169 | 0.121 | 0.189 | 0.031 |
| LowDoc | 0.375 | 0.010 | 0.051 | 0.036 | 0.116 | 0.206 | 0.247 | 1.000 | 0.084 | 0.154 | 0.161 | 0.003 | 0.010 | 0.097 | 0.169 | 0.213 | 0.145 |
| MIS_Status | 0.327 | 0.012 | 0.031 | 0.022 | 0.074 | 0.240 | 0.237 | 0.084 | 1.000 | 0.148 | 0.019 | 0.004 | 0.013 | 0.070 | 0.491 | 0.210 | 0.080 |
| NAICS | 0.447 | 0.157 | -0.124 | -0.091 | -0.147 | 0.206 | -0.050 | 0.154 | 0.148 | 1.000 | 0.132 | -0.154 | 0.271 | -0.175 | -0.081 | 0.432 | -0.034 |
| NewExist | 0.054 | 0.003 | 0.021 | 0.139 | 0.050 | 0.049 | 0.086 | 0.161 | 0.019 | 0.132 | 1.000 | 0.003 | 0.003 | 0.041 | 0.123 | 0.041 | 0.123 |
| NoEmp | -0.226 | 0.034 | 0.445 | 0.121 | 0.455 | 0.000 | 0.075 | 0.003 | 0.004 | -0.154 | 0.003 | 1.000 | 0.124 | 0.449 | 0.200 | 0.010 | 0.059 |
| RetainedJob | 0.546 | 0.377 | -0.070 | -0.263 | -0.138 | 0.046 | -0.142 | 0.010 | 0.013 | 0.271 | 0.003 | 0.124 | 1.000 | -0.205 | -0.157 | 0.025 | -0.026 |
| SBA_Appv | -0.366 | 0.078 | 0.936 | 0.285 | 0.986 | 0.090 | 0.169 | 0.097 | 0.070 | -0.175 | 0.041 | 0.449 | -0.205 | 1.000 | 0.589 | 0.051 | 0.131 |
| Term | -0.297 | 0.082 | 0.521 | 0.196 | 0.558 | 0.215 | 0.121 | 0.169 | 0.491 | -0.081 | 0.123 | 0.200 | -0.157 | 0.589 | 1.000 | 0.207 | 0.142 |
| UrbanRural | 0.659 | 0.025 | 0.040 | 0.013 | 0.051 | 0.280 | 0.189 | 0.213 | 0.210 | 0.432 | 0.041 | 0.010 | 0.025 | 0.051 | 0.207 | 1.000 | 0.126 |
| Zip | -0.038 | 0.026 | 0.115 | 0.031 | 0.119 | 0.093 | 0.031 | 0.145 | 0.080 | -0.034 | 0.123 | 0.059 | -0.026 | 0.131 | 0.142 | 0.126 | 1.000 |
Missing values
Sample
| MIS_Status | LoanNr_ChkDgt | State | Zip | UrbanRural | Bank | BankState | NAICS | NoEmp | NewExist | CreateJob | RetainedJob | FranchiseCode | IsFranchise | Term | RevLineCr | LowDoc | DisbursementGross | ApprovalDate | ApprovalFY | GrAppv | SBA_Appv | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1000014003 | IN | 47711 | 0 | FIFTH THIRD BANK | OH | 451120 | 4 | 2.0 | 0 | 0 | 1 | 1 | 84 | 0 | 1 | 60000.0 | 1997-02-28 | 1997 | 60000.0 | 48000.0 |
| 1 | 1 | 1000024006 | IN | 46526 | 0 | 1ST SOURCE BANK | IN | 722410 | 2 | 2.0 | 0 | 0 | 1 | 1 | 60 | 0 | 1 | 40000.0 | 1997-02-28 | 1997 | 40000.0 | 32000.0 |
| 2 | 1 | 1000034009 | IN | 47401 | 0 | GRANT COUNTY STATE BANK | IN | 621210 | 7 | 1.0 | 0 | 0 | 1 | 1 | 180 | 0 | 0 | 287000.0 | 1997-02-28 | 1997 | 287000.0 | 215250.0 |
| 3 | 1 | 1000044001 | OK | 74012 | 0 | 1ST NATL BK & TR CO OF BROKEN | OK | 0 | 2 | 1.0 | 0 | 0 | 1 | 1 | 60 | 0 | 1 | 35000.0 | 1997-02-28 | 1997 | 35000.0 | 28000.0 |
| 4 | 1 | 1000054004 | FL | 32801 | 0 | FLORIDA BUS. DEVEL CORP | FL | 0 | 14 | 1.0 | 7 | 7 | 1 | 1 | 240 | 0 | 0 | 229000.0 | 1997-02-28 | 1997 | 229000.0 | 229000.0 |
| 5 | 1 | 1000084002 | CT | 6062 | 0 | TD BANK, NATIONAL ASSOCIATION | DE | 332721 | 19 | 1.0 | 0 | 0 | 1 | 1 | 120 | 0 | 0 | 517000.0 | 1997-02-28 | 1997 | 517000.0 | 387750.0 |
| 6 | 0 | 1000093009 | NJ | 7083 | 0 | WELLS FARGO BANK NATL ASSOC | SD | 0 | 45 | 2.0 | 0 | 0 | 0 | 0 | 45 | 0 | 0 | 600000.0 | 1980-06-02 | 1980 | 600000.0 | 499998.0 |
| 7 | 1 | 1000094005 | FL | 34491 | 0 | REGIONS BANK | AL | 811118 | 1 | 2.0 | 0 | 0 | 1 | 1 | 84 | 0 | 1 | 45000.0 | 1997-02-28 | 1997 | 45000.0 | 36000.0 |
| 8 | 1 | 1000104006 | FL | 32456 | 0 | CENTENNIAL BANK | FL | 721310 | 2 | 2.0 | 0 | 0 | 1 | 1 | 297 | 0 | 0 | 305000.0 | 1997-02-28 | 1997 | 305000.0 | 228750.0 |
| 9 | 1 | 1000124001 | CT | 6073 | 0 | WEBSTER BANK NATL ASSOC | CT | 0 | 3 | 2.0 | 0 | 0 | 1 | 1 | 84 | 0 | 1 | 70000.0 | 1997-02-28 | 1997 | 70000.0 | 56000.0 |
| MIS_Status | LoanNr_ChkDgt | State | Zip | UrbanRural | Bank | BankState | NAICS | NoEmp | NewExist | CreateJob | RetainedJob | FranchiseCode | IsFranchise | Term | RevLineCr | LowDoc | DisbursementGross | ApprovalDate | ApprovalFY | GrAppv | SBA_Appv | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 899154 | 1 | 9995423005 | OH | 44405 | 0 | JPMORGAN CHASE BANK NATL ASSOC | IL | 0 | 1 | 1.0 | 0 | 0 | 1 | 1 | 60 | 0 | 0 | 10000.0 | 1997-02-27 | 1997 | 10000.0 | 5000.0 |
| 899155 | 1 | 9995453003 | NY | 11420 | 0 | FLUSHING BANK | NY | 624410 | 2 | 1.0 | 0 | 0 | 1 | 1 | 180 | 0 | 0 | 123000.0 | 1997-02-27 | 1997 | 128000.0 | 96000.0 |
| 899156 | 1 | 9995473009 | MD | 21224 | 0 | BANK OF AMERICA NATL ASSOC | MD | 332431 | 20 | 1.0 | 0 | 0 | 1 | 1 | 60 | 0 | 0 | 50000.0 | 1997-02-27 | 1997 | 50000.0 | 25000.0 |
| 899157 | 1 | 9995493004 | CA | 92020 | 0 | U.S. BANK NATIONAL ASSOCIATION | CA | 314912 | 40 | 1.0 | 0 | 0 | 1 | 1 | 36 | 0 | 0 | 200000.0 | 1997-02-27 | 1997 | 200000.0 | 150000.0 |
| 899158 | 1 | 9995563001 | TX | 75062 | 0 | LOANS FROM OLD CLOSED LENDERS | DC | 0 | 5 | 2.0 | 0 | 0 | 1 | 1 | 84 | 0 | 1 | 79000.0 | 1997-02-27 | 1997 | 79000.0 | 63200.0 |
| 899159 | 1 | 9995573004 | OH | 43221 | 0 | JPMORGAN CHASE BANK NATL ASSOC | IL | 451120 | 6 | 1.0 | 0 | 0 | 1 | 1 | 60 | 0 | 0 | 70000.0 | 1997-02-27 | 1997 | 70000.0 | 56000.0 |
| 899160 | 1 | 9995603000 | OH | 43221 | 0 | JPMORGAN CHASE BANK NATL ASSOC | IL | 451130 | 6 | 1.0 | 0 | 0 | 1 | 1 | 60 | 0 | 0 | 85000.0 | 1997-02-27 | 1997 | 85000.0 | 42500.0 |
| 899161 | 1 | 9995613003 | CA | 93455 | 0 | RABOBANK, NATIONAL ASSOCIATION | CA | 332321 | 26 | 1.0 | 0 | 0 | 1 | 1 | 108 | 0 | 0 | 300000.0 | 1997-02-27 | 1997 | 300000.0 | 225000.0 |
| 899162 | 0 | 9995973006 | HI | 96830 | 0 | BANK OF HAWAII | HI | 0 | 6 | 1.0 | 0 | 0 | 1 | 1 | 60 | 0 | 1 | 75000.0 | 1997-02-27 | 1997 | 75000.0 | 60000.0 |
| 899163 | 1 | 9996003010 | HI | 96734 | 0 | CENTRAL PACIFIC BANK | HI | 0 | 1 | 2.0 | 0 | 0 | 1 | 1 | 48 | 0 | 0 | 30000.0 | 1997-02-27 | 1997 | 30000.0 | 24000.0 |